Learning Reliable Classifiers From Small or Incomplete Data Sets: The Naive Credal Classifier 2
نویسندگان
چکیده
In this paper, the naive credal classifier, which is a set-valued counterpart of naive Bayes, is extended to a general and flexible treatment of incomplete data, yielding a new classifier called naive credal classifier 2 (NCC2). The new classifier delivers classifications that are reliable even in the presence of small sample sizes and missing values. Extensive empirical evaluations show that, by issuing set-valued classifications, NCC2 is able to isolate and properly deal with instances that are hard to classify (on which naive Bayes accuracy drops considerably), and to perform as well as naive Bayes on the other instances. The experiments point to a general problem: they show that with missing values, empirical evaluations may not reliably estimate the accuracy of a traditional classifier, such as naive Bayes. This phenomenon adds even more value to the robust approach to classification implemented by NCC2.
منابع مشابه
Reliable diagnoses of dementia by the naive credal classifier inferred from incomplete cognitive data
Dementia is a serious personal, medical and social problem. Recent research indicates early and accurate diagnoses as the key to effectively cope with it. No definitive cure is available but in some cases when the impairment is still mild the disease can be contained. This paper describes a diagnostic tool that jointly uses the naive credal classifier and the most widely used computerized syste...
متن کاملTree-Based Credal Networks for Classification
Bayesian networks are models for uncertain reasoning which are achieving a growing importance also for the data mining task of classification. Credal networks extend Bayesian nets to sets of distributions, or credal sets. This paper extends a state-of-the-art Bayesian net for classification, called tree-augmented naive Bayes classifier, to credal sets originated from probability intervals. This...
متن کاملJNCC2: An extension of naive Bayes classifier suited for small and incomplete data sets
JNCC2 implements the Naive Credal Classifier 2 (NCC2), i.e., an extension of naive Bayes to imprecise probabilities, designed to return robust classification even on small and/or incomplete data sets, which is often the case in environmental case studies.
متن کاملNaive Credal Classifier 2: a robust approach to classification for small and incomplete data sets
Naive Credal Classifier, which is an imprecise-probability counterpart of Naive Bayes, is rigorously extended to a very general and flexible treatment of incomplete data, yielding a new classifier called Naive Credal Classifier 2 (NCC2). The new classifier delivers classifications that are robust to the presence of small sample sizes and missing values. In particular, some empirical evaluations...
متن کاملLazy Credal Classifier and how to compare credal classifiers
This poster carries out two main contributions: (a) a lazy (or local) version of naive credal classifier (NCC) that we call lazy naive credal classifier (LNCC); (b) two metrics to compare credal classifiers. NCC [1] has extended naive Bayes (NB) to imprecise probabilities, by modeling prior ignorance via the Imprecise Dirichlet Model; the classification is eventually issued by returning the non...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of Machine Learning Research
دوره 9 شماره
صفحات -
تاریخ انتشار 2008